Universal Access To All Knowledge
Home Donate | Store | Blog | FAQ | Jobs | Volunteer Positions | Contact | Bios | Forums | Projects | Terms, Privacy, & Copyright
Search: Advanced Search
Anonymous User (login or join us)
Upload

Reply to this post | Go Back
View Post [edit]

Poster: rayd8 Date: Dec 27, 2004 9:23am
Forum: etree Subject: Disappearing Uploads

This is a recurring problem that I've tried to bring to a curator's attention before.

(old post)http://www.archive.org/iathreads/post-view.php?id=24716

A Radiators show http://www.archive.org/audio/etree-details-db.php?id=20554
Does not show up via the search engine, whether by date
http://www.archive.org/search.php?query=creator%3A%28Radiators%29&sort=-%2Fmetadata%2Fdate

date added
http://www.archive.org/search.php?query=creator%3A%28Radiators%29&sort=-%2Fmetadata%2Fpublicdate

or when searched by the specific date.

The only way it can be found is by the following:
Top Level > Audio > Live Music Archive > Radiators > 2004

I think there may be other shows missing like this as well...

-TIA
-Mike

This post was modified by rayd8 on 2004-12-27 17:21:10

This post was modified by rayd8 on 2004-12-27 17:23:01

Reply to this post
Reply [edit]

Poster: dmoynihan Date: Dec 27, 2004 10:52am
Forum: etree Subject: Help! Dumas Works 07 is Missing Pages 556-557

Hey gang, over Christmas I had my laptop w/ me, so I took the page images from
http://www.archive.org/texts/texts-details-db.php?collection=toronto&collectionid=worksalexdumas07dumauoft

(the first of Dumas' Marie Antoinette saga), and proofed 'em all (passes the time. OK, I'm strange, shut up.)

Anyway, still in transit, I've just about got the thing done, but discover there's no pages 556-557 (image 581 is Page 555; image 582 is page 558).

Err, the book ends on image 590, so we're talking right there.

It's not a huge deal, I had a sorta copy with me, but just of the first 300 pages (sometimes Memoirs of a Physician is broken down in to Joseph Balsamo/MemPhys). These are kinda important pages that can be hard to track down (after all, the chapter on page 558 is titled "Doom"), so if anyone can redo those two again, or point me in some direction, I'd be much obliged.

Else, well, guess people'll have to wait 'till next week or whenever Amazon sends me a copy.

Cheers and happy holidays.

Reply to this post
Reply [edit]

Poster: Administrator, Curator, or Staffbrewster Date: Dec 27, 2004 2:59pm
Forum: etree Subject: Re: Help! Dumas Works 07 is Missing Pages 556-557

Gosh, thank you ever so much. Kind of scarey that we missed that page. We are trying to perfect our processes in toronto.

If you QA a book and it checks out, please post a review for the book saying so. If we have more errors like this, then maybe we should put it into our workflow.

-brewster
Digital Librarian

Reply to this post
Reply [edit]

Poster: dmoynihan Date: Dec 27, 2004 4:39pm
Forum: etree Subject: Re: Help! Dumas Works 07 is Missing Pages 556-557

Well geeze, that's fast! Thanks, and I'll just split the text up for now (Into Balsamo/Memoirs).

I'm actually going through the entire Marie Antoinette series (Works Dumas 7-9), but that's the first error I've seen.

/They're two-column books from the 19th Century, and I know from experience whenever I put something like that up at the Distributed Proofers, they send death threats.

Semi-related, as part of the Million Books Project, you've got With Fire & Sword plus The Deluge, vii by Henry Sienki--Seinke--Sien--the 1905 Nobel Laureate. These are much cleaner scans, and I'm on them as well, but I happen to have Volume I of the Deluge (as well as Pan Michael)... completes the trilogy, and are among the most requested (non-pulp) books around.

Would you like the images for V I of the Deluge? I can give an Abbyy set of tifs. It's a good copy I'm working on... none of that A.L. Burt stuff here.

Reply to this post
Reply [edit]

Poster: Administrator, Curator, or Staffmolly Date: Dec 27, 2004 2:43pm
Forum: etree Subject: Re: Help! Dumas Works 07 is Missing Pages 556-557

I've got my scanners on it, hopefully we'll have this corrected within the next week. if you are ever feeling fruity and notice more errors in the toronto collection, you can email me at molly AT archive.org (if it's missing pages) or submit an error report (if it's something small like a mistake in metadata). we appreciate the free QA work!

-m

Reply to this post
Reply [edit]

Poster: Administrator, Curator, or Staffsimon c Date: Dec 27, 2004 11:23am
Forum: etree Subject: Re: Help! Dumas Works 07 is Missing Pages 556-557

I'll bring this to the attention of the correct people at the Archive, thanks!

Reply to this post
Reply [edit]

Poster: Administrator, Curator, or Staffmolly Date: Jan 21, 2005 1:18am
Forum: etree Subject: Re: Help! Dumas Works 07 is Missing Pages 556-557

Fixed!

Reply to this post
Reply [edit]

Poster: dmoynihan Date: Feb 8, 2005 5:04pm
Forum: etree Subject: Re: Help! Dumas Works 07 is Missing Pages 556-557

Cool, thanks for all that.

I split the title up when I proofed.

Volume one (Memoirs of a Physician) is here, if you guys want it--already been corrected once... sure there will be more changes, that's an evil OCR.

Volume 2, (Joseph Balsamo), is available here.

These'll show up on Gutenberg eventually, but if you want them earlier, enjoy. And thanks for fixing those images.

Reply to this post
Reply [edit]

Poster: Administrator, Curator, or StaffBrad Leblanc Date: Dec 28, 2004 2:01am
Forum: etree Subject: Re: Disappearing Uploads - no shows are missing

This is a recurring problem that I've tried to bring to a curator's attention before

Please enter this as a "Bug Report" if you have not done so already.

I sent another email to the engineers to see if they have had a chance to look into it.

FWIW, the problem is not related to uploads disappearing, the issue is that the search engine just doesn't seem to be running against a full index of all the shows. If you browse the recordings, nothing is missing.

Sorry this hasn't been fixed yet. I hope they look into this and comment soon.

-Brad

This post was modified by Brad Leblanc on 2004-12-28 10:01:02

Reply to this post
Reply [edit]

Poster: Administrator, Curator, or Staffsimon c Date: Dec 28, 2004 11:08am
Forum: etree Subject: Re: Disappearing Uploads - no shows are missing

We do have a fix for this which requires Unix access and to know the URL of the item affected - mail me if you want more info, Brad. However, we just found a possible glitch for the fix, so.. we should probably wait before using it further.

Reply to this post
Reply [edit]

Poster: xtifr Date: Dec 28, 2004 4:52pm
Forum: etree Subject: Re: Disappearing Uploads - no shows are missing

simon c wrote:
We do have a fix for this which requires Unix access and to know the URL of the item affected [...]

That's...extremely unfortunate. There seem to be dozens, if not hundreds, of these lost-between-the-cracks shows, and finding the URL of even a single one is pretty labor intensive. And since it seems to keep happening, the problem is just going to get worse....

This may be a FAQ, but have you guys thought about opening the source for the code behind the Archive? Remember the Open Source Motto: "many eyes make all bugs shallow." And as an experienced UNIX programmer myself, I'd think my available spare volunteer time could be more productively spent reviewing the code, looking for the causes of bugs, rather than in doing tedious manual searches through the data to try to find their effects. Especially if the latter is going to be a never-ending search.

Reply to this post
Reply [edit]

Poster: pt Date: Dec 29, 2004 3:11am
Forum: etree Subject: Re: Disappearing Uploads - no shows are missing

xtifr: This is a good suggestion. We believe strongly in open source and in principle agree that the Archive would be better if we open sourced all of our code. However, in practice, we're not in a good position right now to package it up into a sourceforge project. It wasn't designed with that in mind.

However, the part that is causing the indexing problem is our search engine/index, which appears to ignoring certain requests to index items (shows). This is something we should be running off of sourceforge and I'll see what I can do about getting it up there soon. It would be great to have other eyes on the source to help us ferret out a few of the bugs we know are still lurking.